Sentence Extraction-Based Machine Reading Comprehension for Vietnamese
نویسندگان
چکیده
The development of natural language processing (NLP) in general and machine reading comprehension particular has attracted the great attention research community. In recent years, there are a few datasets for tasks Vietnamese with large sizes, such as UIT-ViQuAD UIT-ViNewsQA. However, not diverse answers to serve research. this paper, we introduce UIT-ViWikiQA, first dataset evaluating sentence extraction-based language. UIT-ViWikiQA is converted from dataset, consisting comprises 23.074 question-answers based on 5.109 passages 174 Wikipedia articles. We propose conversion algorithm create three types approaches Vietnamese. Our experiments show that best model XLM-R $$_{Large}$$ , which achieves an exact match (EM) 85.97% F1-score 88.77% our dataset. Besides, analyze experimental results terms question type effect context performance MRC models, thereby showing challenges
منابع مشابه
Modelling Reading Times in Bilingual Sentence Comprehension
Relatively little is known about the interaction between a bilingual’s two languages beyond the word level. This paper investigates the issue by comparing word reading times (RTs) in both L1 and L2 to quantitative predictions by statistical language models. Recurrent neural networks are trained on either a Dutch corpus, an English corpus, or the two corpora combined (i.e., the bilingual network...
متن کاملStochastic Answer Networks for Machine Reading Comprehension
We propose a simple yet robust stochastic answer network (SAN) that simulates multistep reasoning in machine reading comprehension. Compared to previous work such as ReasoNet, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves result...
متن کاملA case for the sentence in reading comprehension.
PURPOSE This article addresses sentence comprehension as a requirement of reading comprehension within the framework of the narrow view of reading that was advocated in the prologue to this forum. The focus is on the comprehension requirements of complex sentences, which are characteristic of school texts. METHOD Topics included in this discussion are (a) evidence linking sentence comprehensi...
متن کاملS-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension
In this paper, we present a novel approach to machine reading comprehension for the MS-MARCO dataset. Unlike the SQuAD dataset that aims to answer a question with exact text spans in a passage, the MS-MARCO dataset defines the task as answering a question from multiple passages and the words in the answer are not necessary in the passages. We therefore develop an extraction-then-synthesis frame...
متن کاملEvaluating Machine Reading Systems through Comprehension Tests
This paper describes a methodology for testing and evaluating the performance of Machine Reading systems through Question Answering and Reading Comprehension Tests. The methodology is being used in QA4MRE (QA for Machine Reading Evaluation), one of the labs of CLEF. We report here the conclusions and lessons learned after the first campaign in 2011.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-82147-0_42